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1 INTRODUCTION 



Data from IRAS (the Infra-Red Astronomical Satellite) al- 
lows unparalleled uniformity, sky coverage and depth for 
mapping the local galaxy density field. In 1992, with com- 
pletion of the QDOT and 1.2 Jy surveys (Rowan- Robinson 
et al. 1990a, Lawrence et al. 1999, Strauss et al. 1990, 1992, 
Fisher et al. 1995), and with other large redshift surveys in 
progress, it became clear that a complete redshift survey of 
the IRAS Point Source Catalog (the PSC, Beichman et al. 
1984, henceforth ES) had become feasible. 

Our specific targets for the PSCz survey were two-fold: 
(a) we wanted to maximise sky coverage in order to predict 
the gravity field, and (b) we wanted to obtain the best pos- 
sible completeness and flux uniformity within well-defined 
area and redshift ranges, for statistical studies of the IRAS 
galaxy population and its distribution. The availability of 
digitised optical information allowed us to relax the IRAS 
selection criteria used in the QIGC (Rowan- Robinson et al. 
1990b), and use optical identification as an essential part 
of the selection process. This allowed greater sky coverage, 
being essentially limited only by requiring that optical ex- 
tinctions be small enough to allow complete identifications. 
The PSC was used as starting material, because of its supe- 
rior sky coverage and treatment of confused and extended 
sources as compared with the Faint Source Survey (Moshir 
et al. 1989). The depth of the survey (0.6 Jy) derives from 
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the depth to which the PSC is complete over most of the 
sky. 

The topology of the survey is analysed and presented 
by Canavezes et al. (1998); the inferred velocity field by 
Branchini et al. (1999); the real-space power spectrum and 
its distortion in redshift-space by Tadros et al. (1999); the 
redshift-space power spectrum by Sutherland et al. (1999). 
The direction and convergeance of the dipole has been in- 
vestigated by Rowan- Robinson et al. (1999), and its impli- 
cations for cosmological models by Schmoldt et al. (1999). 
Sharpe et al. (1999) presented a least-action reconstruction 
of the local velocity field, while an optical/IRAS clustering 
comparison was presented by Seaborne et al. (1999). Many 
of these results are summarised in Saunders et al. (1999). 



2 CONSTRUCTION OF THE CATALOGUE 
2.1 Sky coverage 

We aimed to include in the survey all areas of sky with 
(a) reliable and complete IRAS data, and (b) optical extinc- 
tion small enough to allow reliable galaxy identifications and 
spectroscopic followup. We defined a mask of those parts of 
the sky excluded from the survey; the final mask was the 
union of the following areas: 

(i) Areas failing to get 2 Hours-Confirming coverages 
(HCONs, ES III.C.l). In these areas, there is either no data 
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at all or the data does not allow adequate source confirma- 
tion. 

(ii) Areas flagged as High Source Density at 12, 25 or 
60^im. HSD at 12 or 25 implies an impossibly high stellar 
density for galaxy identifications. Areas flagged as HSD at 
60^im were processed differently in the PSC, with complete- 
ness sacrificed for the sake of reliability. 

(iii) Areas with Tioo > 25 MJy ster -1 . The 100/im inten- 
sity values are those of Rowan- Robinson et al. (1991). Above 
this value, we found the fraction of sources which were iden- 
tifiable as galaxies dropped dramatically. 

(iv) Areas with extinctions Ab > 2 m , where secure op- 
tical identifications become increasingly difficult. Our ex- 
tinction estimates were based on the Jioo values above, and 
incorporated a simple model for the the variation in dust 
temperature across the Galaxy; they are discussed further 
in Section 4.3. Except towards the centre and anti-centre, 
this criterion and the previous one are almost equivalent 

(v) Small patches covering the LMC and SMC, defined 
by Jioo > 10 and 5 MJy ster" 1 respectively. In these ar- 
eas, there are large numbers of HII regions in the clouds 
themselves with similar optical and IRAS properties to back- 
ground galaxies, making identifications very uncertain. 

The mask is specified as a list of excluded 1 deg 2 'lune- 
bins' (ES Ap.X.l), so these values are necessary averages 
over 1 deg 2 . 

The overall area outside the mask is 84% of the sky. For 
statistical studies of the IRAS galaxy population and its dis- 
tribution, where uniformity is more important than sky cov- 
erage, we made a 'high |6|' mask, as above but including all 
areas with Ab > l m , and leaving 72% of the sky. In practice, 
this criterion is almost identical to iioo > 12.5 MJy ster -1 . 
Henceforth, when we say 'high-latitude', we simply mean 
outside this mask. Both masks are shown in Figure 1. 

2.2 PSC selection criteria 

Our aim was to relax the criteria of the QIGC sufficiently 
to pick up virtually all galaxies, even at low latitude, purely 
from their IRAS properties; simultaneously, we wanted to 
keep contamination by Galactic sources to a reasonable 
level. Our actual colour selection criteria were almost the 
same as the QIGC: 
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However, unlike the QIGC, upper limits were used only 
where they guaranteed inclusion or exclusion. We did not at 
this stage exclude any source solely on the basis of an iden- 
tification with a Galactic source. Because the PSCz evolved 
from the QIGC, there remain in the catalogue 3 galaxies 
which actually fail the new selection criteria. In total, we 
selected 16422 sources from the PSC. 

Many very nearby galaxies have multiple PSC sources, 
associated with individual starforming regions. 70 such 
sources were pruned to leave for each such galaxy, the single, 
brightest PSC source. Also at this stage, Local Group galax- 
ies were excised from the catalogue, and a separate catalogue 
of far-infrared properties for Local Group galaxies made. 



2.3 Extended sources 

All IRAS surveys are bedevilled by the question of how to 
deal with galaxies which are multiple or extended with re- 
spect to the IRAS 60/^m beam. The size of most 60/im detec- 
tors was 1.5' in-scan by 4.75' cross-scan. The raw data was 
taken every 0.5', and for the PSC this was then filtered with 

an 8-point zero-sum linear filter, of the form h + + H • 

Hence galaxies with in-scan far-infrared sizes larger than 
about 1.5', will have their fluxes underestimated in the PSC. 
There is a lesser sensitivity to cross-scan diameter, caused 
by some of the scans only partially crossing the full width 
exteneded sources. 

The approach we have settled on is to preferentially 
use PSC fluxes, except for sources identified with individ- 
ual galaxies whose large diameters are likely to lead to sig- 
nificant flux underestimation in the PSC. The PSC flux is 
based on a least chi-squared template fit to the point-source- 
filtered data-stream, so the flux is correctly measured for 
slightly extended sources, with diameters smaller than about 
1.5'^. As far-infrared diameters are typically half optical, 
(e.g. Rice et al. 1988), and optical D25 diameters are typ- 
ically several times the FWHM for spirals, we can expect 
that isolated galaxies must have extinction-corrected diam- 
eters larger than several arcmin for their fluxes to be badly 
underestimated by the PSC. 

Fluxes for extended sources can be derived using the 
ADDSCAN (or SCANPI) software provided at IPAC, which 
coadds all detector scans passing over a given position. We 
have tested how the ratio of PSC-to-extended flux depends 
on D25, and, as expected, we find that the ratio is almost 
constant for galaxies up to 2.5' (Figure 2), where the addscan 
flux is typically 10% larger. However, we also find that 
addscan fluxes are systematically 5% larger than PSC fluxes, 
even for much smaller galaxies. This discrepancy is not due 
to any difference in calibration (we have tested this by ex- 
tracting a point-source-filtered flux from the addscan data 
and substituting this for the PSC flux, but the discrepancy 
in Figure 2 remains). The reason is the large number of mul- 
tiple and/or interacting galaxies seen by IRAS, for which the 
addscan picks up the entire combined flux. Using such com- 
bined fluxes throughout is not a consistent approach, since 
nearby interacting galaxies will always be resolved into two 
or more sources, while more distant ones will not be^. The 
effect of the PSC filter is more subtle: a close pair of galaxies 
unresolved by the IRAS beam will in general have unequal 
fluxes, and in a flux limited survey, at least one will be below 
the flux limit. In such a situation, the zero-sum linear PSC 
filter still overestimates the flux of the brightest source on 
average (because of clustering), but by much less than the 
addscan. 

Very large galaxies will have fluxes underestimated by 
the addscan, because their cross-scan extent is large com- 
pared with the detector size. Fluxes for galaxies with optical 

* By comparison, the Faint Source Survey (Moshir et al. 1989) 
fluxes are peak amplitudes of the point-source-filtered data, so 
are much less tolerant of slightly extended sources, 
t In any survey with unresolved sources, a consistent approach to 
multiple sources is not possible, except by artificially degrading 
the resolution for each source to make it the same in physical 
units as for the most distant. 
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Figure 1. 15431 PSCz sources identified as possible or definite galaxies, high |b| and default masks in galactic coordinates. 



diameters -D25 > 8' have been taken from Rice et al. (1988), 
who made coadded maps for each such galaxy. In principal, 
galaxies somewhat smaller than this may have underesti- 
mated fluxes from the addscans. However, we find that, for 
the 19 PSCz galaxies in Rice et al. with quoted diameters 
8' — 10', the flux difference between addscan and Rice et 
al. fluxes is log 10 (S 6 0A / S 6 or) = 0.044 ± 0.034, that is the 
addscan fluxes are slightly but not significantly larger; so 
we do not expect smaller galaxies to have underestimated 
addscan fluxes. 

Bearing these points in mind, we proceeded as fol- 
lows: we made a catalogue, with the same sky coverage 
as the PSCz, of optically-selected galaxies from the LEDA 
database (Paturel et al. 1989) with extinction-corrected D25 
diameters larger than 2.25', where the extinctions were es- 
timated as per Section 4.3, and the consequent corrections 
to the diameters are as given by Cameron (1990). In Saun- 
ders et al. (1995), it was argued that LEDA is reasonably 
complete to this limit. For the largest sources, we used 
the positions and fluxes of Rice et al. (1988). IPAC kindly 
addscanned all the remainder for us to provide coadded data. 
We then used software supplied by Amos Yahil to extract 
positions, fluxes and diameters from this data, on the as- 
sumption that the galaxies have exponential profiles. Where 
these addscan fluxes are used in the catalogue, they have 
been arbitrarily decreased by 10%, to bring them statisti- 
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Figure 2. Ratio of extended flux to PSC flux (filled circles) or 
point-sourcc-filtcrcd addscan flux (open circles), as a function of 
optical diameter. The open circles are displaced slightly to the 
right for clarity. The inner error bars are the uncertainty on the 
ratio, the outer error bars are the population scatter. Where D25 
is not known, it is set to 0. 

cally into line with the PSC fluxes at the 2.25' switchover. 
We ended up with 1402 sources associated with large galax- 
ies entering the catalogue, and 1290 PSC sources associated 
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with large galaxies flagged for deletion. The latter are kept 
in the catalogue, so that a purely IRAS-derived catalogue 
can be extracted if wanted. 



2.4 Other problem sources 

In the QIGC, sources flagged as associated with confirmed 
extended sources, or with poor Correlation Coefficient, or 
flagged as confused, were addscanned. The above prescrip- 
tion should have dealt with the extended sources; the only 
other reasons for a galaxy source to have a poor CC are 
that (a) it is multiple, and as argued above we prefered PSC 
fluxes in these cases, or (b) it has low S/N, in which case the 
PSC is an unbiased flux estimator, while addscanning risks 
noise-dependent biases. The confusion flag is set very conser- 
vatively (i.e. most sources flagged as confused have sensible 
PSC fluxes) and in any case the PSC in general deals better 
with confusion than the addscans, in both cross-scan and in- 
scan directions. For these reasons, and to avoid introducing 
any latitude-dependent biases into the catalogue, we opted 
to use PSC fluxes throughout for sources not identified with 
optical galaxies as above. Addscan fluxes are included in 
the catalogue for all sources for those wishing to experiment 
with them. 



2.5 Supplementary sources 

For a source to be accepted into the PSC, it required suc- 
cessful Hours-Confirmation on two separate HCONS (ES 
V.D.6). Thus at low S/N, areas of the sky with only 2 
HCONS are inevitably less complete than those with 3 or 
more; the estimated completeness at 0.6 — 0.65 Jy in 2HCON 
areas is only 82% (ES XII. A. 4). To improve this complete- 
ness, we supplemented the catalogue with 1HCON sources 
satisfying our colour criteria in the Point Source Catalog Re- 
ject File (ES VII. E.l), where there is a corresponding source 
in the Faint Source Survey. We demanded that PSC and FSS 
sources be within each other's 2a error ellipse, and that the 
fluxes agree to within a factor of 1.5. This revealed many 
sources in the Reject File where two individual HCON de- 
tections had failed to be merged in the PSC processing (ES 
XII. A. 3), as well as sources which failed at least one HCON 
for whatever reason. New sources were created or merged 
with existing ones, and the Flux Overestimation Parame- 
ters (ES XII. A. 1) assigned or recalculated accordingly. As 
well as these 1HCON sources, we also searched in the Reject 
File for additional sources with flux quality flags 1122 and 
1121 0; neither of which category made the PSC. Finally, we 
looked for sources in the PSC itself with flux quality flags 
1113, but correlation coefficient at GOfim equal to A,B or 
C indicating a meaningful detection. Altogether we found 
an additional 323 galaxies, mostly in 2HCON regions, and 
made 143 deletions as a result of merging individual HCON 
detections. 



+ l=upper limit only, 2=moderate quality detection, 3=good de- 
tection, in each of the four bands 



2.6 Optical identifications 

Optical material for virtually all sources was obtained from 
COSMOS or APM scans, including new APM scans taken 
of 150 low-latitude POSS-I E plates. In general, we used red 
plates at |6| < 10° and blue otherwise. The actual categories 
of optical material are: 

1. Uncalibrated POSS-O plates scanned with APM. Here the 
magnitudes are typically good to 0.5 m for faint (B ~ 19 m ) 
galaxies. 

2. SRC J plates scanned with APM. These plates were 
scanned and matched for the APM survey, and have bj mag- 
nitudes typically good to 0.25 m for faint galaxies. 

3,4,5. Uncalibrated SRC J or EJ plates scanned with COS- 
MOS, giving bj accurate to about 0.5 m . 

6. SRC SR plates scanned with COSMOS. The quoted mag- 
nitudes are r-magnitudes, good to 0.5 m . 

7. POSS-E plates scanned with APM. For these plates, the 
standard POSS-O calibration was assumed. Since the O 
plates are about 1 magnitude deeper than the E plates, while 
B — R^l for IRAS galaxies, this gives a reasonable approx- 
imation to the B magnitude. When there is extinction, this 
procedure gives systematically too bright a magnitude by 
about OASAb (assuming Mathis 1990). The scatter is still 
dominated by the 0.5 m zero-point error. 

At bright magnitudes, photographic photometry for 
galaxies becomes increasingly uncertain because of non- 
linearity and saturation in the emulsion. 

The image parameters from COSMOS and APM data 
were also used to get arcsecond positions and offsets from 
nearby stars, and to make 'cartoon' representations of the 
4' x 4' field centred on each IRAS source. The identifications 
were made using the likelihood methods of Sutherland and 
Saunders (1992). In general, and always at low latitudes, 
these cartoons were supplemented by grayscale images from 
the Digital Sky Survey and/or inspection of copy plates, and 
any change in best identification noted. 

At this stage, non-galaxies were weeded out by a com- 
bination of optical appearance, IRAS colours and addscan 
profiles, VLA 20cm maps from the NVSS survey (Condon 
et al. 1998), millimetre data from Wouterlout and Brand 
(1989), SIMBAD and other literature data. Several hun- 
dred low-latitude sources were also imaged at K' , as part 
of the extension of the PSCz to lower latitudes (Saunders et 
al. 1999). We found a total of 1376 confirmed non-galaxies. 
We are left with 15332 confirmed galaxies, and a further 79 
sources where no optical identification is known but there 
is no clear Galactic identification either. The distribution of 
identified galaxies and the mask are shown in Figure 1. 

The Galactic sources are dominated by infrared cirrus 
(774 sources). However, there are also 88 planetary nebulae, 
140 emission-line stars, 24 sources identified with Galactic 
HII regions, 212 sources identified with bright stars or re- 
flection nebulae, and 138 sources associated with YSO's. 



3 THE REDSHIFT SURVEY 

An essential part of the project was maintenance of a large 
database of redshift information from the literature, from 
databases such as the NED and LEDA (the NASA and 
Lyons Extragalactic Databases) and Huchra's ZCAT, and 
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also work in progress for other surveys. Redshifts were ac- 
cepted on the basis that their claimed error was better than 
any other available for that source, including our own. 

Of the 15,411 galaxies in the sample, about 8,000 had 
known redshift at the inception of the project, with about 
another 2,000 expected to be observed as part of ongoing 
projects such as the CfA2 and SSRS surveys. The project 
was fortunate enough to be allocated 6 weeks of INT+FOS 
time, 1 week INT+IDS, 6 nights AAT+FORS time, 18 
nights on the CTIO 1.5m, two weeks at the INAOE 2.1m and 
120 hours at Nangay radiotelescope, over a total of 4 years. 
4600 redshifts were obtained in this time. For the INT+IDS 
and CTIO spectra, redshift determination was made from 
the average observed wavelengths of the Ha , Nil and SII 
features. The rest of the data was taken with low dispersion 
spectrographs with resolutions around 15-20 Angstroms, 
and the Ha /Nil lines are blended together, as are the two 
SII lines. We modelled each continuum-subracted spectrum 
as a linear combination of Ha , Nil and SII features, with 
redshift as a free parameter. The model giving the small- 
est \ 2 versus the data gave the redshift and its error, also 
the various linestrengths and a goodness of fit. Spectra with 
poor x 2 ! large redshift uncertainty or unphysical line ra- 
tios were checked by hand and where necessary refitted 
with different FWHM or initial guesses. Wavelength cali- 
bration included cross-correlation of each sky spectrum with 
a well calibrated template. The final derived errors aver- 
age 120 kms -1 , as compared with 300 kms -1 for the QDOT 
survey. Further details are presented in Keeble (1996), and 
Oliver et al. (1999). 

By construction, we have minimised the overlap with 
other redshift surveys, but there are to date 448 galax- 
ies for which we have both our own measurement, and 
higher resolution measurements by other workers (Figure 
3). There are 31 sources where the velocity difference is 
more than 3a (errors combined in quadrature), if these are 
clipped out, the remaining 417 sources give an average offset 
Vpscz — Vut = — 6.1±6.1 kms -1 , with a scatter of 124 kms -1 
and a reduced \l = 0.89 . The scatter includes the contri- 
bution from the non-PSCz redshift, and suggests that our 
quoted errors are good estimates of the genuine external er- 
ror. Of the 31 discrepant redshifts, 5 involve HI observations 
with unrealistically small quoted errors and with absolute 
differences less than 250 kms -1 , a further 11 are between 
100 and 600 kms -1 and 3 — 5a, and suggest a non-Gaussian 
error distribution, and 15 are greater than 700 kms -1 and 
5a and must represent different galaxies or incorrect line 
identification or calibration. We thus have 15 redshifts se- 
riously in error out of a total of 896 recent measurements 
(both ours and other workers) giving an error rate of 1.7%. 

We did not pursue very faint (bj > 19. 5 m ) galaxies, 
on the basis that (a) they were certain to be at distances 
too large to be included for either statistical or dynami- 
cal studies, and (b) very large amounts of telescope time 
would be needed to achieve useful completeness for these 
galaxies. There are 438 sources without known redshift with 
a clear or probable faint optical galaxy identification (with 
bj > 19. 5 m ), and a further 127 with no obvious optical iden- 
tification but no secure identification as a Galactic source 
either. At time of writing we are still lacking redshifts for 
189 brighter (bj < 19.5 m ) galaxies. The sky distribution of 
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Figure 3. PSCz versus literature redshifts. Error bars show com- 
bined errors. 

these categories is shown in Figure 4. Redshifts are available 
for 14677 sources. 

3.1 n(z) and selection function 

The n(z) distribution is shown in Figure 5. The source den- 
sity amounts to 1460 gals ster -1 at high latitudes and the 
median redshift is 8500 kms -1 . To account for the effect of 
the flux limit on the observed number density of galaxies as 
a function of distance, we need to know the selection func- 
tion ip(r), here defined as the expected number density of 
galaxies in the survey as a function of distance in the absence 
of clustering. We have derived this both non-parametrically 
and parametrically using the methods of Mann, Saunders 
and Taylor (1996). This method is almost completely in- 
sensitive to the assumed cosmology, in the sense that the 
derived expected n(z) is invariant. The resulting selection 
function can be transformed to other cosmologies or defini- 
tions of distance simply by mapping the volume element or 
distance, keeping n(z)dz invariant. 

For simplicity, and to allow comparison with simu- 
lations, we assume for derivation purposes a Euclidean 
Universe without relativistic effects, and with distance r 
equal to U/lOOfr kms -1 . We derive the result both non- 
parametrically, and parametrically using the double power- 
law form 

«'>-*-G)'">G)T W) 

which very well describes the non-parametric results. The 
parameters t/>, a., r», 7 and /3 respectively describe the nor- 
malisation, the nearby slope, the break distance in ft -1 Mpc, 
its sharpness and the additional slope beyond it. 
Using the high-latitude PSCz to derive the selection func- 
tion, and correcting redshifts only for our motion with 
respect to the centroid of the local group, V = Vhei + 
300sinicos6, we obtain 

i/>* = 0.0077, a = 1.82, r, = 86.4, 7 = 1.56, (3 = 4.43 
Both non-parametric and parametric results are shown 
in Figure 6. The uncertainty in the selection function is 
less than 5% for distances 30 - 200/i -1 Mpc, and 10% for 
10 — 300 /i _1 Mpc, although the selection function drops by 
4 decades over this range. 
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PSCz galaxies without redshift 
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Figure 4. Sky distribution of PSCz sources without redshift, with galaxy identifications bj < 19. 5 m (open circles, 192 sources) and 
fainter than this or with no identification (crosses, 542 sources). 
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Figure 5. n(z) distribution for high latitude PSCz survey. Error 
bars are J3-weighted. The line is the prediction from the selection 
function. 

In Figure 6, we also show the non-parametric selection func- 
tion derived by the same method, for the QDOT and 1.2 Jy 
surveys. 



Selection function for PSCz. QDOT and 1.2Jy surveys 
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Figure 6. Parametric and non-paramctric selection functions for 
high latitude PSCz survey (line and filled circles). Also shown 
are the QDOT (open circles) and 1.2Jy (crosses) non-paramctric 
selection functions. 

4 RELIABILITY, COMPLETENESS, 
UNIFORMITY, FLUX ACCURACY 

The utility of the PSCz for cosmological investigation de- 
pends on its reliability, completeness and uniformity. 
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4.1 Reliability 

The major sources of unreliability in the catalogue are 
(a) incorrect identifications with galaxies nearby in angu- 
lar position, and (b) incorrect identification of spectral fea- 
tures. Later cross-referencing used increasingly sophisticated 
methods based on Sutherland and Saunders (1992), but 
many older identifications depend on simple 2' proximity. 
The most recent cross-correlation with ZCAT provided 1975 
updated velocities, of which 84 were altered by more than 
500 km s -1 and 50 by more than 1000 km s -1 , suggesting an 
error rate of 2%, in agreement with the analysis of Section 
3. 

The identifications made for our own redshift followup were 
done much more carefully than those taken from the liter- 
ature, using a likelihood approach including full knowledge 
of the error ellipse, optical galaxy source counts etc, and 
checked by eye from a greyscale image. The number of am- 
biguous identifications is negligible, though the PSC source 
is often confused between two galaxies separated by a few 
arcmin in the cross-scan direction. Where there was real am- 
biguity, we took redshifts for both galaxies and used the one 
with larger Ha flux. It is also possible that Galactic sources 
were occasionally identified as background galaxies, but we 
believe this to be very rare. 

4.2 Completeness 

Incompleteness enters into the catalogue for many reasons: 

1) There will be IRAS incompleteness where sources with 
true 60/im flux greater than 0.6 Jy fail to appear in the cat- 
alogue. The largest source of this incompleteness is the ar- 
eas of sky with only 2HCONs, where the PSC incomplete- 
ness is estimated as 20% (differential) and 5% (cumulative) 
at 0.6 Jy (ES XII. A. 4) . Based on source counts, we esti- 
mate that our recovery procedure (Section 2.5) for these 
sources has reduced the overall incompleteness in these ar- 
eas from 5% to 1.5%; Figure 7 shows the source counts for 
high-latitude 2HCON sky. Our recovery procedure was im- 
possible for |6| < 10°, so lower-latitude 2HCON sky (2% of 
the catalogue area) retains the PSC incompleteness. 

2) The PSC is confusion limited in the Plane. However, by 
construction, the PSCz does not cover any areas with High 
Source Density at 60/im (as defined in ES V.H.6), and the 
mask denned at lOO^im effectively limits the number of con- 
fusing sources. Also, at low latitudes, there are known prob- 
lems in the PSC with noise lagging. However, the source 
counts shown in Figure 8 show low-latitude incompleteness 
to be small down to 0.6 Jy. 

3) Some galaxies are excluded by our colour criteria. Cool, 
nearby galaxies may fail the 100/60/im condition, but will 
normally be included separately as extended sources. From 
the comparison with the 1.2 Jy survey, we estimate that 
about 50 galaxies from the PSC have been excluded (see 
Section 5). 

4) No attempt was made to systematically obtain redshifts 
for galaxies with bj > 19. 5 m . At high latitudes, the work of 
e.g. Clements et al. (1996) shows that these will in general be 
further than z > 0.1. At lower latitudes, incompleteness cuts 
in at lower redshifts. It was originally hoped that the PSCz 
would be everywhere complete to z — 0.05, but it is clear 
from the 3D density distributions that there is significant 



(10% or more ) incompleteness down to z=0.04 towards the 
anti-centre. The reasons for this are discussed in the next 
subsection. 

Patchy extinction can lead to higher local extinction than 
our values, which are averages over 1 deg 2 bins. We have ob- 
tained K' images of most low-latitude sources without obvi- 
ous galaxy or Galactic identification; many faint galaxies are 
revealed, but only a handful of nearby ones. Spectroscopy 
of the missing low-latitude galaxies is continuing as part of 
the Behind The Plane extension to the PSCz survey; for the 
time being, we estimate that good completeness has been 
achieved for z < 0.1/10 (0 ' 2(Ab+a b /10)) . The incompleteness 
is strongly concentrated towards the anticentre and at low 
latitudes; for \b\ > 10°, the survey is estimated to be useably 
complete to z = 0.05 everywhere. 

The number of galaxies with bj < 19. 5 m for which red- 
shifts are still unknown is 192, or 1.2% of the sample. A 
further 85 have only marginal redshift determinations and 
some of these will be incorrect. Both unknown and marginal 
redshifts are well distributed round the sky, but with some 
preference for lower latitudes. 



4.3 Extinction maps 

Our extinction maps started from the 7ioo intensity maps of 
Rowan-Robinson et al. (1991), binned into lune bins, with 
the Iioo/Av ratio as given by Boulanger and Perault (1988), 
and Ab/Av ratio as given by Mathis (1990). Because the 
dust temperature declines with galactic radius, this proce- 
dure over-estimates extinctions towards the galactic centre, 
and under-estimates them towards the anti-centre. We at- 
tempted to correct for this dependence of dust temperature 
on position by devising a simple model for the dust and 
starlight in our galaxy. We assumed a doubly exponential, 
optically thin distribution of dust and stars, with standard 
IAU scalelengths for the solar radius and stellar distribution 
(r = 8.5 kpc, r sc = 3.5 kpc, z sc = 0.35 kpc) and an assumed 
the dust to have the same radial and half the vertical scale- 
length. The dust properties were assumed to be uniform, and 
we assumed a dust temperature of 20K at the solar radius, 
and proportional to the one fifth power of the local stellar 
density elsewhere. We then found, for each position on the 
celestial sphere, the ratio of 100/im emission to column den- 
sity of dust, and normalised this ratio by the Boulanger and 
Perault value for the NGP. 

Subsequent to our definition of the PSCz catalogue and 
mask, Schlegel et al. (1998) used the IRAS ISSA and COBE 
DIRBE data to make beautiful high resolution maps of the 
dust emission and extinction in our Galaxy. Comparison of 
our own maps and those of Schlegel et al. show that (a) our 
temperature corrections across the galaxy are too small, and 
(b) there are several areas where cooler (16K) dust extends 
to \b\ ~ 30°. 

Overall, our extinction estimates may be in error by a factor 
1.5-2, in the sense of being too low towards the anticentre 
and too high towards the galactic centre. This has two prin- 
cipal effects, (a) Towards the anticentre, galaxies may fall 
below the bj = 19. 5 m limit because of extinction. A subse- 
quent program of if'-imaging has revealed a handful of extra 
nearby galaxies, and a significant number at redshifts 0.05 
and greater, (b) The definition of the optically-selected cat- 
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alogue in Section 2.3 depends on the extinction corrections. 
We will have selected too many optical galaxies towards the 
centre and too few towards the anticentre. However, our 
matching of addscan and PSC fluxes was designed to be ro- 
bust to exactly this sort of error. To test for any effect, we 
have compared the simple, number-weighted dipole of the 
surface distribution of PSCz galaxies, (a) using the normal 
catalogue and (b) using purely PSC-derived fluxes as de- 
scribed in section 2.3. The dipole changes by 1° in direction 
and 2.5% in amplitude when we do this, showing that any 
bias caused by incorrect extinctions is negligible. 

4.4 Flux accuracy and uniformity 

The error quoted for PSC 60/xm fluxes for genuine point 
sources is just 11%(ES VII. D. 2). We are more concerned 
with any non-fractional random error component, since 
this may lead to Malmquist-type biases. The analysis in 
Lawrence et al. (1999), based on the 12/60/im colours of 
bright stars, finds an absolute error of 0.059 ± 0.007 Jy, 
in addition to a fractional error of 10%. Stars are better 
point sources than galaxies, so this may underestimate the 
absolute error for our sample. We have made an indepen- 
dent estimate, by investigating the scatter between our PSC 
and addscan fluxes. Of course PSC and addscan fluxes start 
from the same raw data, but the processing and background 
estimation are entirely different, while the actual photon 
noise is negligible. The absolute component of the scatter is 
0.06 — 0.07 Jy, of which an estimated 20% comes from the 
error in the addscan flux - confirming that the Lawrence et 
al. value is a reasonable estimate of the PSC absolute error 
component. 

This error estimate leads to Malmquist biases in the source 
densities of order 5-6% (differential, at the flux limit) and 
2-3% (cumulative) (Murdoch, Crawford and Jauncey 1973). 
2HCON sky may have biases as large as 10% (differential) 
and 4% (cumulative), so the non-uniformity introduced into 
the catalogue should be no worse than 5% (differential) and 
2% (cumulative) . This is borne out by the source counts in 
Figures 7 and 8, but note that in any noise-limited catalogue 
such as the PSC, where the noise varies across the catalogue, 
there will always be a regime where Malmquist biases are 
masked by incompleteness. 

Lawrence et al. (1999) find evidence for slight non-linearity 
in the PSC flux scale, but this should not affect any analy- 
sis except evolutionary studies. The effect on evolution was 
considered by Saunders et al. (1997). 

Non-uniformity can also come about as a result of simple 
changes in the absolute flux scale. A small fractional error in 
the calibration will on average lead to an error half again as 
large in the source density. There are three obvious reasons 
for flux scale variations: 

1) The absolute calibration of the third 3HCON was revised 
by a few percent after the release of the PSC (and FSS). 
The effect of this revision on PSC fluxes would be to change 
those in the 75% of the sky covered by 3HCONS by 1%. 

2) Whenever the satellite entered the South Atlantic 
Anomaly, radiation hits altered the sensitivity of the detec- 
tors, and data taken during such times were discarded (ES 
III.C.4). However, this leaves the possibility of Malmquist 
effects, and also data taken near the boundaries of the 
SAA potentially suffers residual effects. We have investi- 
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Figure 7. Source counts for all high latitude sky (filled circles), 
high latitude 2HCON sky (open circles), and area with potential 
SAA problems (triangles). 

gated this by checking the source counts for declinations 
—40° < S < 10°, where data taken is most likely to be af- 
fected. We see no evidence for any variation (Figure 7) above 
the few % level in source counts. 

3) Whenever the satellite crossed the Galactic Plane, or 
other very bright sources, the detectors suffered from hys- 
teresis. This effect was investigated by Strauss et al. (1990). 
They found that the likely error is typically less than 1% 
and always less than 2.2%. This is confirmed by the con- 
stancy, to within a few percent, of our galaxy source counts 
for identified galaxies in the PSCz as a function of iioo- 
Overall, differential source densities across the sky due to 
incompleteness, Malmquist effects and sensitivity variations 
are not believed to be greater than a few percent anywhere 
at high latitudes for z < 0.1. Tadros et al. (1999) found an 
upper limit to the rms amplitude of large scale, high latitude 
spherical harmonic components to the density field of the 
PSCz of 3%. Since this is close to the expected variations 
due to clustering, the variations due to non-uniformity in 
the catalogue must be smaller than this. At lower latitudes, 
variations are estimated to be no greater than 10% for z < 
0.05. 



5 COMPARISON WITH THE 1.2 Jy SURVEY 

The 1.2 Jy survey of 5500 IRAS galaxies (Fisher et al. 1995) 
used looser colour criteria than the PSCz, and as such acts 
as a valuable check on the efficacy of our selection procedure 
from the PSC. We find that the 1.2 Jy survey contains 11 
galaxies that have been excluded by our selection criteria. It 
contains a further 5 galaxies that satisfy the PSCz criteria, 
but are not included due to programming and editing errors, 
either during construction of the QIGC survey or its various 
extensions and supplements to form the PSCz. Extrapolat- 
ing these numbers to lower fluxes, we can expect that about 
50 PSC galaxies are missing altogether from the catalogue. 
Conversely, the conservative selection criteria for the 1.2 Jy 
survey led to much greater levels of contamination by cirrus 
and other Galactic sources than in the PSCz. In the 1.2 Jy 
survey, these were eliminated by visual inspection of sky sur- 
vey plates; inevitably real galaxies occasionally got thrown 
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Figure 8. Source counts for identified PSCz galaxies as a function 
of 100/^m background, iioo < 7.5MJyster — 1 (filled circles), 7.5- 
15 (open circles) and 15-22.5 (triangles). 



out by mistake, especially at low latitudes. We have found, 
within the PSCz area, 110 galaxies which are misclassified 
as Galactic in the 1.2 Jy survey, and we have obtained red- 
shifts for 67 of them. Most of the remainder are fainter than 
our bj = 19. 5 m cutoff. There are also known to be to date a 
further 117 galaxies classified as Galactic in the 1.2 Jy sur- 
vey outside the PSCz area. These were found as part of the 
ongoing 'Behind the Plane' extension of the PSCz to lower 
latitudes described in more detail in Saunders et al. (1999). 



6 ANCILLARY INFORMATION 

Along with the PSC data, the catalogue also contains the fol- 
lowing information: POSS/SRC plate and position on that 
plate; the RA,<5, offset, diameters and magnitude of the best 
match from the digitised sky survey plates; name, magnitude 
and diameters from UGC/ESO/MCG, PGC name, de Vau- 
couleurs type and HI widths where available; most accurate 
available redshift and our own redshift measurement; classi- 
fication as galaxy /cirrus/etc; estimated lino and extinction; 
addscan flux and width when treated as an extended source, 
and point source filtered addscan flux. 



7 ACCESSING THE CATALOGUE 

The data is available from the CD S catalogue service 
(tittp://cdsweb.u-strasbg.fr/Cats.html). Full and short ver- 
sions of the catalogue, maskfiles, description files, format 
statements and notes, are also available via the PSCz web 
site 



( tittp: / / www-astro.physics.ox.ac.uk/-~ 



anonymous 



ftp 



-wjs/pscz.html), or by 
from 



ftp: / / ftp- astro. physics. ox. ac.uk /pub / users /wjs/pscz / 
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